Planning with Partially Specified Behaviors
نویسندگان
چکیده
In this paper we present a framework called PPSB for combining reinforcement learning and planning to solve sequential decision problems. Our aim is to show that reinforcement learning and planning complement each other well, in that each can take advantage of the strengths of the other. PPSB uses partial action specifications to decompose sequential decision problems into tasks that serve as an interface between reinforcement learning and planning. On the bottom level, we use reinforcement learning to compute policies for achieving each individual task. On the top level, we use planning to produce a sequence of tasks that achieves an overall goal. Experiments show that our framework is competitive with realistic environments where a robot has to perform some tasks.
منابع مشابه
On the Synthesis of Situation Control Rules under Exogeneous Events
One approach for computing plans for reactive agents is is to check goal statements over state trajectories modeling predicted behaviors of an agent. This paper describes a powerful extension of this approach to handle time, safety, and liveness goals that are specified by Metric Temporal Logic formulas. Our planning method is based on an incremental planning algorithm that generates a reactive...
متن کاملSelf-Efficacy and Planning Predict Dietary Behaviors in Costa Rican and South Korean Women: Two Moderated Mediation Analyses
Dietary planning is supposed to mediate between intentions and dietary behaviors. However, if a person lacks self-efficacy, this mediation might fail. A cross-sectional study in Costa Rica and a longitudinal study in South Korea were designed to examine the moderating role of self-efficacy in the intention– planning–behavior relationship. Intentions, planning, self-efficacy, dietary behaviors, ...
متن کاملIntentions, planning, and self-efficacy predict physical activity in Chinese and Polish adolescents: Two moderated mediation analyses1
Planning is assumed to translate intentions into health behaviors. However, this may fail due to a lack of perceived self-efficacy. People do not tackle challenging tasks if they harbor self-doubts, even if they have made a good action plan. The present two descriptive longitudinal studies are designed to examine the putative moderating role of self-efficacy in the planning-behavior relationshi...
متن کاملDecomposition and Causality in Partial-order Planning
We describe DPOCL, a partinl-order csnsal llnk planner that includes action decomposition. DPOCL builds directly on the SNLP algorithm (McAllester Rosenbiltt 1991), and hence is clear and simple, ud can readily be integrated with other SNLP extensions. In addition, DPOCL is specifically designed to handle partially specified action decompositions. Plan generation in DPOCL exploits the planner’s...
متن کاملAutomated Hierarchy Discovery for Planning in Partially Observable Environments
Planning in partially observable domains is a notoriously difficult problem. However, in many real-world scenarios, planning can be simplified by decomposing the task into a hierarchy of smaller planning problems. Several approaches have been proposed to optimize a policy that decomposes according to a hierarchy specified a priori. In this paper, we investigate the problem of automatically disc...
متن کامل